PCX 



WORLD INTELLBCTUAL PROPERTY ORGANIZATION 



Internationa! Bureau 

INTERNAXrONAL APPLICATION PUBUSHED UNDER TOE PATENT COOPERATION TREATY (POT) 



(51) IntemationBl Patent dassi&catkm ^ : 

H04L 12/54 



(II) International Publication 
(43) Internationa! Publication 



WO 00/42749 

20 July 2000 (20,07.00) 



(21) International Application Number: PCTJSBS^mm 

(22) bilemational Filing Date: 28 December 1999 (28.12.99) 



14 January 1999(14.01.99) 



(71) Applicant: TELEFONAKHEBOLAGET LM ERICSSON 
(publ) [SE/SEl; S-I26 25 Stockholm (SE). 

(72) Inventors: LUNDQVIST, Stellan; Djupbficksvagen 28, JP903 
42 Umeg (SE). OHLSSON. Mattias; Istidsgatan 45, S^06 
55 Ume2 (SE). NYGREN, JUirsaii R&ttnECsplanadcn 18d, 
S-SW3 28 Umea (SE). 

(74) Agait: KLAS, Narin; Encsson Radio Systems AB, Common 
Patent Dept, S-164 80 Stockholm (SE). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE. DK, DM, EE. 
ES. FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE. KG, KP. KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG, MK. MN, MW, MX. NO. NZ, PL. PT, RO, RU, 
SD, SB, SG, SI, SK, SL, TS, TM, TR, IT, TZ, UA, UG, 
UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, LS. 
MW, SD, SL, SZ, IZ. UG. ZW). Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR. GB, GR, IE, IT, LU, MC, 
NL. PT, SE), OAPI patent (BF. BJ, CF, CQ. Q. CM, GA, 
ON, GW, ML, MR, NE, SN, TD, TO). 



With international search report. 

B^ore the expiraHcn of the time limit for amending the 
claims and to be repuUished in the event of the receipt of 



(54)Tme: ADAPTIVE JrilfaR BUFFERING 



Application 



Jitter 
Control 



CPU 



Receiving Node 



Jitter 
Buffer 



Variance 
Buffer 



' 10 



'20 



Memory 



(57) Abstract 

In a packet coimnunication system, the delay time needed in a jitter buffer is detennined, mabling a smooth data feed to an 
application witiiout excessive delays, by methods and apparatus that vary the size of the jitter buffer based ob an estimated variation of 
paeltet transinission delay derived from the times of arrival of stored packets. A variance buffer stores variances of the times of anrival of 
stored packets, and the estimated variation of packet ttansmis^on delay is derived from the stored variances. Hie size of the jitter bnffer 
can changed preferentially during periods of discontinuous packet transmission. 
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ADAPTIVE JFTTER BUFFERING 



BACKGROUND 

This invention relates to electrical teieoonfimunicatlon and more particulariy to 
5 packet networks using the Internet Protocol and even more particulariy to mmimizing 
delays in packet delivery In such networks. 

Applications sending reat-time data streams over unreliable Internet Protocol (IP) 
networks have a tot of problems to overcome, induding long and variable delays and 
lost and out-of-sequence packets. Today, these problems can be reduced by using 

1 0 techniques such as the Real Time Protocol (RTP) and jitter buffers. 

The RTP is a real-time transport protocol that provides end-to-end network 
transport functions suitable for applications transmitting real-time data, such as audio, 
video, or simulation data, over multicast or unicast network services. The RTP does 
not address resource reservation and does not guarantee quality-of- service for 

1 5 real-time services. The RTP provides for sequence numbering, which tells the 

receiving node if the packets are aniving in sequence or at all. The data transport is 
augmented by a control protocol (RTCP) to allow monitoring of the data delivery in a 
manner scalable to large multicast networks, and to provide minimal control and 
Identification functionality. The RTP and RTCP are designed to be independent of the 

20 underiying transport and network layers. The RTP is specified in H. Schulzrinne et al., 
Request for Comments 1 389 "RTP: A Transport Protocol for Real-Time Applications", 
http://194.52.182.96/rfc/rfc1889.html (Feb. 1, 1996). 

Jitter buffers are memories in receiving nodes that are used for sorting the 
packets into the con-ect sequence, and delaying the packets as needed to compensate 

25 for variations in network delay. The RTP specification discusses such interamval jitter 
In Section 6.3.1 and Appendbc A.8 that provide for fomiing a 32-bit estimate of the 
statistical variance of the RTP data packet interarrival time, measured in timestamp 
units and expressed as an unsigned integer. The interarrival jitter J is defined to be the 
mean deviation (smoothed absolute value) of the difference D in packet spacing at the 

30 receiver compared to the sender for a pair of packets. As shown in the equation below, 
this is equivalent to the difference In the "relative transit time" for the two packets; the 
relative transit time is the difference between a packet's RTP timestamp and the 
receiver's ctock at the time of anival, measured in the same units. If Si is the RTP 
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P^^*^^* ^ 's the time of arrival in RTP timestamp units for packet 
i, then for two packets i and j, D may be expressed as: 

D(i. j) = (Rj . Ri) - (Sj - Si) = (Rj - Sj) - (Ri - Si) 
The interarrival jitter is caiculated continuously as each data packet i is received from 

5 the source, using this difference D for that packet and the previous packet 1-1 in otxier 
of arrival (not necessarily in sequence), according to the formula: 

J = J + (lD(i-1,i)i.J)/16 
This algorithm is the optimal first-order estimator and the gain parameter 1/16 gives a 
good noise reduction ratio while maintaining a reasonable rate of convergence. 

0 The problem today is detenninlng the delay time needed in the jitter buffer to 

achieve a smooth data feed to the application, without excessive delays. This problem 
can seriously affect voice communication using the internet/Intranet as the backbone 
for transmitting the speech. In additbn, the need for smarter use of network bandwidth 
will become more and more important as the number of users of IP telephony 

5 increases. 

In the communications between a mobile cellular telephone and a radio base 
station, it is common to employ a technique called discontinuous transmission (DTX) 
mainly to save battery power in the mobile. Briefly stated, DTX means that a 
transmitter does not send any data if it does not have any new data to send. When a 
0 mobile station detects that the user is not speaking, the mobile station sends only 
Silence Descriptor (SID) frames that contain background noise. The SiD frames are 
sent periodically, generally about every 480 millseconds. and between the SID frames 
the mobile station sends nothing. 

The idea of using DTX to save bandwidth has been brought up in the IMTC 
Voice over IP Forum Technical Committee (V61P 1A 1.0), but no implementations have 
yet been discussed. There are also some special cases that need to be handled when 
using DTX over an IP network. 

SUMMARY 

Applicants' invention solves the problem of determining the delay time needed in 
a jitter buffer and achieves the object of obtaining a smooth data feed to an application, 
without excessive delays. Thus, Applicants' inventbn improves voice communication 
using the Intemet/lntranet as the backbone for transmitting the speech and uses 
networic bandwidth more Intelligently. 
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fn one aspect of the invention, there is provided a receiving node in a packet 
communication system that Includes a jitter buffer that has a variable size, that stores 
paci^ets aniving at the receiving node, and that releases stored packets to an 
application executing in the receiving node, wherein each packet has a respective 
5 sequence number, stored packets are released periodically, and each entry in the jitter 
buffer has one of a plurality of states; and a processor that varies the size of the jitter 
buffer based on an estimated variation of packet transmission delay derived from the 
times of anival of stored packets. 

The receiving node may further indude a variance buffer that stores variances of 

1 0 the times of arrival of stored packets, and the time that the first-amved packet is 

released is based on the time of arrival of the first packet and the initial delay, and the 
estimated variation of packet transmission delay is derived from the stored variances. 

The states of the jitter buffer entries may be free, busy, and used, the free state 
indicating that no amved packet is stored at that location in the jitter buffer, the busy 

1 5 state Indicating that an an-ived packet Is stored at that location in the jitter buffer, and 
the used state Indicating that an arrived packet stored at that location is being released 
to the application. Arrived packets may then be stored in respective locations that are 
maiiced in the busy state; packets may be released in response to queries by the 
application; and when the application queries the jitter buffer for a next packet, that 

20 packet's respective location may be changed to the used state and the respective 
location of the previously amved packet may be changed to the free state. 

The processor may decrease the size of tlie jitter buffer while the receiving node 
is in a discontinuous transmission mode, thereby avoiding discarding arrived packets 
that hold speech information. The receiving node may then include a DTX buffer that 

25 stores selected packets aniving at the receiving node. An aniving packet is selected 
based on at least one of whether the aniving packet is first to anive after a speech 
period and holds total noise Infomiation and whether the aniving packet contains noise- 
update infonnation, arrives after a speech period, and has a respective sequence 
number that Is subsequent to the sequence number of an eariier arriving packet holding 

30 speech Information. The processor then dianges the size of the jitter buffer while 
packets are being selected, thereby avoiding discarding packets holding speech 
information. 

In another aspect of the Invention, there is provided a method of storing in a 
buffer packets aniving at a receiving node in a packet communication system and 
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releasing arrived packets to an application executing in the receiving node. The 
method includes the steps of determining a time Tr to release a first arrived packet to 
the application, the time Tr being the first packet's arrival time Ta plus an initial delay, 
while vi/aiting for the first anived packet to be released from the buffer, comparing a 
5 current time to the time Tr and releasing the first anived packet when the time Tr has 
passed, and after the first anived packet is released, releasing stored packets 
periodically at first intervals. 

The comparing may be performed in response to queries from the application 
that occur periodically at second intervals, stored packets anived after the first anived 

1 0 packet may be released in response to queries from the application that occur 
perlodtaally at the first intervals, and the first interval may be at least as long as the 
second Interval. Also, the first interval may be substantially equal to transmission 
Intervals between aniving packets. 

In a further aspect of the Invention, there is provided a method of adapting a size 

15 of a buffer that stores packets aniving at a receiving node in a packet communication 
system. The method includes the steps of: counting a number of arrived packets 
having sequence numbers lower than that of an oldest anived packet stored in the 
buffer; comparing the number to an accepted loss parameter, if the number is greater 
than the accepted loss parameter, increasing a change indicator counter and if the 

20 number is equal to or less than the accepted loss parameter, decreasing the change 
indicator counter; increasing the size of the buffer when the change indicator counter 
reaches an indicator roof parameter If the buffer is not already at its largest pemiitted 
size; and decreasing the size of the buffer when the change Indicator counter reaches 
an indicator floor paranreter if the buffer is not already at its smallest pennitted size. 

25 The step of determining the size of the buffer may be performed by detennining 

an expected arrival time of a packet in relation to an anrival time of a first packet of a 
packet sequence; determining an arrival time variance for the packet; determining a 
measured delay that Is a time the packet will be delayed in the buffer determining a 
desired delay based on the arrival time variance and the accepted loss parameter; and 

30 detennining the size of the buffer based on the desired delay and the measured delay. 
The anivai time variances may be stored in a variance buffer and sorted and 
nomialized. Aiso, measured delays may be accumulated for packets having anivai 
time variances stored in the variance buffer, and the desired delay may be determined 
based on the sorted, normalized anivai time variances and the accepted loss 
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parameter. The size of the buffer is then determined based on the desired delay and 
an average measured delay derived from the accumulated measured delays. 

The size of the buiTer rnay be decreased while the receiving node is in a 
discontinuous transmission mode, thereby avoiding discarding arrived packets that hold 
speech information. The method may then include the step of storing in a DTX buffer 
selected packets arriving at the receiving node. An aniving packet is selected based 
on at least one of whether the arriving packet is first to arrive after a speech period and 
holds total noise infonnation and whether the arriving packet contains noise-update 
Information, arrives after a speech period, and has a respective sequence number that 
is subsequent to the sequence number of an eariier arriving packet holding speech 
information. The size of the buffer is then changed while packets are being selected, 
thereby avoiding discarding packets holding speech information. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention and its objects and advantages win be understood by readirjg this 
description in conjunction with the drawings, in which; 

FIG. 1 illustrates a packet header format; 

FIGS. 2A, 2B illustrate a receiving node having a jitter buffer; 

FIG. 3 illustrates a method of storing and releasing packets In the per buffen 

FIG. 4 illusfrates a method of determining when to change the size of a jitter 

buffer; 

FIG. 5 illustrates a method of determining a size change of a jitter buffer; 
FIGS. 6A, 6B illusfrate a buffer for storing packet anival time variances; 
FIG, 7k illustrates measured delays for packets in the jitter buffer, 
FIG. 7B illustrates a principle behind the buffer size change detennination; and 
FIGS. 8A. 88 illustrate operation of a jitter buffer with discontinuous packet 
transmission. 

DETAILED DESCRIPTION 

Applicants' invention solves the pnsbiem of determining the delay time needed in 
a jitta* buffer to adiieve a smooth data feed to an application, without excessive delays. 
Applicants' solution needs only an initial delay value to be provided, after which it 
adapts itself to a suitabJe delay by measuring arrival time variations and a number of 
packets arriving too late. Applicant's solution is based on an assumption that the 
transmitter sends the data packets at Intervals that are known to the receiver, e.g., 
regular intervals. 
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In accordance with Applicants' invention, an adaptive jitter buffer stores data 
packets aniving at a node over the IP network and liandles data packets that arrive late 
or out of sequence. The transmitter sends the data packets over the network using a 
protocol such as the RTP that provides for a respective sequence number in each 
5 packet, which tells the receiving buffer in what sequence the arriving packets should be 
entered into the buffer. 

As an example of a useful protocol, the header format of RTP packets is 
illustrated by FIG. 1, which indicates bit positions and octet numbers across the fop. 
Each header comprises at least twelve of octets organized into the following fixed 
10 header fields: 

version (V): 2 bits 

padding (P): 1 bit 

extension (X): 1 bit 

contributing source (CSRC) count (CC): 4 bits 
15 marker (MJ: 1 bit 

payload type (PT): 7 bits 
sequence nymber 16bits ■ 
timestamp: 32 bits 

synchronizatton source (SSRC): 32 bits 

20 CSRC fist: 0 to 15 items, 32 bits each 

The first twelve octets are present In every RTP packet, while the list of CSRC 

is present only when Inserted by a RTP mbcer. The details of the fixed 
header fields are described in Section 5.1 of the RTP specification. It is sufficient to 
note here that the PT field Identifies the format of the RTP payload and determines its 

25 interpretation by the application that is to use the payload. A profile specifies a default 
static mapping of payload type codes to payioad fonnats. Additional payload type 
codes may be defined dynamically. An RTP sender emits a single RTP paytoad type at 
any given time. 

The sequence number increments by one for each RTP data packet sent, and 
30 may be used by the receiver to detect packet ioss and to restore packet sequence. 
The initial value of the sequence number Is random (unpredictable) to make 
known-plaintext attacks on encryption more difficult, even if the source itself does not 
encrypt, because the packets may flow through a translator that does. It will be 
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appreciated, therefore, that it is not necessary for the transmitter to use tiie RTP but 
only to provide suitable sequence numbers in the packets. 

In accordance with Applicants' invention, the receiving node determines times to 
release arrived packets from an adaptive jitter buffer to an application. The 
arrangement of a receiving node 1 is depicted highly schematically in FIG. 2A and the 
arrangement of the jitter buffer 10 in the receiving node 1 fe depicted in mor^ detail in 
FIG. 2B. The node 1 receives a stream or sequence 2 of arriving pacltets that are 
provided to a processor CPU in the receiving node. As Illustrated in FIG. 2A, the 
processor executes the instructions that make up the application to which the packets 
are directed as weli as the instmclions that mak&up the methods of controlling the jitter 
butfer 10 and, if provided, a variance buffer 20 that are described in more detail below. 
The buffers 10, 20 reside in a memory provided in the receiving node 1 . 

FIG. 2B shows a sequence of incoming data packets 5, 6, 7, . . . that are stored 
in respective locations in the jitter buffer 10 as indicated by the arrow A. Already 
an-ived packets are released from the buffer 10 to the application as indicated by the 
an-ow B. FIG. 28 depicts a sltuatkin in which already arrived packets 3. 4 have already 
been stored in locations in the buffer 10. The locations In the buffer 10 are identified as 
either free, used, or busy for reasons that are explained bekiw. 

FIG. 3 illustrates the process of storing incoming packets and releasing arrived 
packets to an application. One important aspect of this method is ttie calculation of a 
time Tr to release the first anived packet to the application (step 302). In essence, this 
time is the first packet's anival time Ta plus a specified initial delay that is an initial 
estimate of a desired delay Td, which is determined as described below. 

While the application waits for data to be released from the jitter buffer 1 0, the 
application may query the buffer periodically, at short intervals (step 304). As long as 
the application is not given a data packet by the jitter buffer, the applicatfon does not do 
anything. Each time the application queries the buffer for the first data packet, the 
buffer compares the current time t to the release time Tr of the first packet (step 306). 
It will be appreciated that further packets. I.e., packets arrwing after the first arrived 
packet, can arrive during steps 304, 306 before the first packet has been released. 
After the release time has passed, the buffer gives the first data packet to the 
application the next time the application sends a short-interval or "fast" query to the 
jitter buffer 10 (step 308). or perhaps more precisely the processor in the receiving 
node that controls the jitter buffer 10. 
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^^^^ tl^® fj^t packet is given to the application, it is preferable that no more time 

comparisons are done when releasing packets, incoming packets are stored in the 
jitter buffer (step 310) as described below, and packets are given to the application 
whenever it queries for them (steps 312, 314). These queries for more data can arrive 
at the jitter buffer 1 0 with inten/als between them that are substantially the same as or 
longer than the intervals between the "fasr queries (j.e., these queries are "slow" 
compared to the queries for the first arrived packet). The time Intervals between the 
slow queries need not be less than substantially the transmit Intervals between the 
packets, which as noted above are known to the receiver. In a simple communicatton 
system, the packets are transmitted at regular intervals, i.e., the transmit intervals are 
substantially equal to each other. In fact, the time intervals between the slow queries 
are preferably substantially the same as the packet transmission inten/als. 

It will be appreciated that the fast and slow queries need not arise from the 
application, but more generally can be any signals, e.g.. from a timer or timers, that can 
cause the first amVed packet and/or subsequently anived packets to be released to the 
application. 

Referring again to FIG. 2B, each jitter buffer entry can be in one of three 
different states: free, busy, or used. The free state means that no arrived packet is 
stored at that location In the buffer; the busy state means that an anived packet is 
stored at that location; and the used state means that the arrived packet stored at that 
locafron Is being released to or accessed by the application. Packets are released from 
the jitter buffer 10 In accordance with the value of a read pointer that indicates which 
buffer location to access as each query Is received fnDm the application. It will be 
understood that the read pointer is in essence nothing more than a recirculating 
counter, with each count value corresponding to a respective location in the Jitter buffer. 

As packets are released from the jitter buffer to the application, the states of the 
entries change in the following way. The first packet actually anived is stored in a 
location that is mariced In the busy state, and the read pointer Is Initialized to that 
location. After the first arrived packet has been released to the application as 
described above, that location is changed to the used state. It is generally 
advantageous for the packet cunrently being accessed by the application (i.e., the 
buffer entry In the used state) to be treated as the first packet in the buffer 1 0. When it 
is time for the application to get the next packet the entry currently in the used state is 
changed to the free state and the next entry in the buffer (as Indicated by the read 
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pointer) is transformed from the busy state to the used state, if the next entry in the 
buffer is in the free state, no packet is given to the appiicatlon (since there is not an 
arrived paci^et stored at that location), and the read pointer indicating which buffer 
iocation to read the next time the application queries the buffer for a pacl<et is 
advanced. If the sequence number of an incoming packet is Iowa- than the sequence 
number of a used-state packet, the incoming packet is regarded as arriving too late and 
is discarded. 

Four parameters may be used advantageously for configuring Applicants' 
method of adaptffig the luehavior of tiie jitter buffer to changing communication 
conditions; A Sampling Interval is a number of data packets to measure over before a 
buffer see change calculation is perfomied. An Acceptable Loss is a number of data 
packets the loss of which due to delay can be accepted during one Sampling Interval 
before changing the size of the buffer. Indirator Roof and Indicator Floor parameters 
are used for controlling the sensitivity of the method. These and other parameters 
employed in Applicants' methods can generally be changed as desired at anytime. 

These parameters and a Change Indicator counter are used in Applicants' 
method of determining when to change the buffer size that is illustrated by the flow 
chart of FIG. 4, which begins with setting the parameters and initializing the Change 
Indicator counter to zero (step 402). This method can be executed from time to time at 
the prompting of the applicatton receiving the packets, but it is currently believed to be 
preferable for tiie method to run continuously as packets are received. 

the jitter buffer 10 stores incoming packets in respective memory locations (step 
404), and checks whether the buffer has received the number of packets specified by 
the Sampling Interval parameter (step 406). When the number of received packets is 
greater than the Sampling Interval parameter, the number of packets aniving too late, 
i.e, the number of airiving pad^ets having sequence numbers tower than that of the 
packet being accessed by the application (i.e., the buffer entry in ttie used state), is 
read from a Lost Packets counter (step 40B). The "Lost Packets" count includes only 
packets that are delayed, not packets that are lost. The Lost Packets counter Is 
updated as each packet is received after being initialized to zero at the start of a 
sampling interval corresponding to the Sampling interval parameter. 

The Lost Packets count for the sampling interval is compared to the Accepted 
Loss parameter (step 410). If the Lost Packets count Is greater than the Accepted Loss 
parameter, the Change Indicator counter is increased by one (step 412). If the Lost 
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Packets count is equal to or less than the Accepted loss parameter, the Change 
Indicator counter Is decreased by one (step 414). It can be advantageous in some 
circumstances for the Change Indicator counter not to be decreased when the Lost 
Packets count is equal to the Accepted Loss parameter. Such circumstances include 
for example when the application requires more caution for decreasing the jitter buffer 
size. Packets are discarded when the size of the jitter buffer is decreased, so more 
caution is usually appropriate to avoid excessively discarding packets when there are 
rapid up/down changes in the network transmission delay. If this is done, the Accepted 
Loss parameter used in the method depicted in FIG. 4 must not be zero. 

When the Change Indicator counter reaches the indicator Roof parameter (step 
416). it Is time Id Increase the size of the jitter buffer 10, provided the buffer is not 
already at its largest pemiitted size (step 418). When the Change Indicator counter 
reaches the Indicator Floor parameter (step 420), it is time to decrease the size of the 
buffer (step 422), provided the buffer is not already at its smallest permitted size. It is 

i currently believed that the largest buffer size, which corresponds to the longest delay In 
the jitter buffer, is dependent on the application. In addition. It can be noted that the 
longest delay In the jitter buffer is the same as the longest desired delay Td if the 
Accepted Loss parameter is zero. For example, two-way voice or video communication 
could find a one-second delay unacceptable but such a delay and even longer delays 

I couid be acceptable for data file transfers and one-way video communication. It Is 
currently believed that the smallest buffer size would typically be one packet, Le., the 
shortest delay Td would typically be the packet transmission Intenrai. It is conceivable 
that the smallest buffer size could be zero packets, i.e., packets could be released 
immediately upon arrival (Td = 0), but that would require a communication network 

• having iittle if any variance in transmission delay. 

Once it is detemiined that the size of the jitter buffer 1 0 should be changed by 
the method depicted in FIG. 4, the new size of the buffer (step 41 8 or step 422) can be 
determined by the method illustrated by FIG. 5, which begins as the method depicted in 
FIG. 4 with the jitter buffer 10 storing incoming packets in respecUve memory locations 
(step 502). Here, it is not necessary to check whether the buffer has received the 
number of packets specified by the Sampling Interval parameter, although this could be 
done if desired. 

During the sampling interval, the an-ival time of each packet is compared to the 
arrival time of the first packet of this packet sequence. By adding the product of the 
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packet transmission interval and the difference between the sequence numbers of 
successive packets to the arrive! time of the first packet, the expected arrival time of a 
particular packet in relation to the an-ival time of the first packet can be determined 
{step 504). This expected arrival time Ta„ of the packet having sequence number n is 
5 given by the following expression: 

Ta-t^(N„-NO + Ta,- 

where Ta, is the arrival time of the first packet, tl Is the packet transmission interval, N„ 
is the sequence number of the currently aniving packet, and N, is the sequence 
number of the first packet Instead of using the arrival time of the first packet in the 

10 sequence. the method can use the anival time of the first packet in the current 
sampling Interval. Also as part of step 504, measured delays are accumdated as 
explained In more detail below. 

An arrival time variance v for packet n is determined according to the following 
expression when the packet n anives: 

15 v = Tactual„ - Ta„ 

where Tactuain is the actual arrival time the packet anives. in accordance with one 
aspect of the invention, this variance may be stored in the variance buffer 20 (step 
506). The buffer 20 for storing the variances is preferably separate from the jitter buffer 
1 0 and has a size corresponding to the same length of the sampling interval, so that 

20 variances are stored one by one until the buffer 20 is full (step 508). As depicted by 
FIG. 6A, the first entry in the buffer 20 represents the first packet of this sampling 
'hterval. and the last entry represents the last packet of this sampling interval. The 
variance entries in the buffer 20 are sorted and nomnalized (step 510) such that the 
smallest value is zero as depicted in FIG. 6B. 

25 It will be appreciated that in general it is not necessary to use a variance buffer 

20 and that the desired delay, i.e., the size of the jitter buffer, can i^e determined as 
each packet anrives from each packet's respective variance v. Thus, the processes of 
steps 506, 508, 510 may be consWered, In a way, as operating on a single variance, 
i.e., that of one of the arrived packets. 

30 Based on ttie contents of the buffer 20 or on an individual variance as just 

described, the desired delay Td can be determined (step 512). The example depicted 
in FIG. 6B shows ttiat the variance in arrival times is seventeen time units, This means 
that if the Accepted Loss parameter is set to zero (meaning no packets can be lost), 
then the desired delay Td in the jitter buffer 10 during this sampling Interval is 
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seventeen time units. If the Accepted Loss parameter is set to one (meaning one 
packet can be lost), then the desired delay Td In the jitter buffer during this sampling 
Interval is thirteen time units. The desired delay Td is given in general by the following 
expression: 

5 Td = buffer(Sampling Inten/al - Accepted Loss) 

if the buffer 20 uses a 1 -based Indexing mechanism in which the first entry in the buffer 
is indexed as one, the second entry is indexed as two, etc. 

During the sampling inten/al an accumulated measured delay can be maintained 
as noted above in connection with step 504. The measured delay is the time the 
10 current aniving packet will be delayed In the buffer, as illustrated by RG. 7A. The 
measured delays for the packets in the sampling interval can be accumulated as the 
packets anive for deriving an average measured delay Md that is used as described 
below. 

In accordance with Appiicants' invention, the desired delay Td and the measured 
1 5 delay Md are used for detemnining the size of any necessary jitter buffer size change 
(step 514). FIG. 7B graphically describes how the current size of the jitter buffer 10 can 
be changed by (Td/2 - Md) time units without causing aniving packets (Ta) to be 
considered late. 

Packet arrival time variances are represented in FIG. 7B on the horizontal axis. 

20 Ta is the time variance in when the application requests new data packets. There will 
be no or negligible variance in Ta when the application requests packets at regular 
intervals. The lines Dl and Du represent the lower and upper limits of the range of 
variances In packet anival time for packets in the sampling inten/al, and the short 
vertical lines between Dl and Du represent variances for individual packets. It will be 

25 seen that Du is the variance for the packet selected as the desired delay Td in the 
preceding expression. If the range Dl-Du includes all variances during a sampling 
interval, then Td Is that for an Accepted Loss value of zero. Md is the average 
measured delay, that can be obtained by accumulating the measured delays ibr 
packets that actually arrived during this sampling interval, i.e.. packets amved both on 

30 time (in the range Dl-Du) and too late, and dividing by that number of packets. 

The purpose of step 514 is to move Dl as close to Ta as possible, i.e.. to 
minimize the measured delay in the jitter buffer without losing packets, according to the 
following expression: 

Delay Modification = Td/2 - Md 
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The Delay Modification value tells the number of time units tliat ttie size of the jitter 
buffer should be increased or decreased. This value is rounded up to the closest value 
that is a multiple of the packet transmission interval, and is then divided by the 
transmission interval to find the number of packets more or fewer needed in the jitter 
5 buffer. 

As indicated by step 51 6, the jitter buffer size is ihcrisased by denying the 
application newly aniving packets according to the number determined in step 514 and 
is decreased by discarding a number of packets according to the number detemnined in 
step 514. It will be understood that steps 51 0 through 51 6 con-espond to steps 41 8 and 

10 422. In this way, the size of the Jitter buffer Is adapted to the communication conditions 
e}dsting during the sampling interval. 

Applicants' method of adapting the jitter buffer size may be implemented 
advantageously in combination with DTX, which as explained above means that the 
transmitter does not send any packets when it does not have any new data to send. If 

1 5 this method is used, there will be periods of tinne when the jitter buffer does not receive 
any new data and it is therefore possible to decrease the buffer size without discarding 
packets. 

Another advantage to DTX with an adaptive jitter buffer is provided if the packets 
arise from a speech application, such as voice over an IP networic The adaptive jitter 

2D buffer can change its size from time to time, and when the buffer size decreases, some 
speech frames vAW be discarded. This can disturb the speech vocoder, distorting the 
speech. Nevertheless, discarding or losing packets during periods of DTX, e.g., 
silence, avoids the disturbance. As noted above, when a user is not speaking, a 
transmitter periodically sends only SID frames that contain background noise. In 

25 general, using DTX with an adaptive jitter buffer for a speech application requires 

storing the SID frames in a separate location in the menrary of the receiving node 1 , not 
in the jitter buffer 10. 

The following describes an implementation of DTX with an adaptive jitter bufiier 
in a communication system in accordance with the Global System for Mobile 

30 communication (GSM) standard. Such communication systems are well known In the 
art so they need not be desalbed In detail here. It will be appreciated that DTX may be 
employed when the packets hold information d;her than speech and noise information, 
which thus will be understood to mean more generally any first and second types of 
information used in the communication system employing DTX. 
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In DTX in a GSM system, the transmitter such as a mobile station generates two 
types of SID frames or packets and sends them to the radio base station (RBS), which 
may be a receiving node 1 as described above or which may simply forward the 
packets to a receiving node 1. One type of SID packet contains the total noise 
information and the other type of SID packet contains only an update of the noise. 
-Generally, a total-noise SID packet is sent first during a silence period, and atter that, 
noise-update SID packets are sent except in a situation explained in more detail below. 

The RBS may re-format packets received from a mobile station as RTP packets 
or the mobile station may produce such packets itself, but in any event, the payload of 
each RTP.packet holding speech or SID data, includes space for two flags: a SID flag 
that indicates whether the payload is holding speech or SID data, and a TAF flag that 
identifies the packet as either a total-noise SID packet or a noise-update SID packet. 
Thus, a node can detect the difference between a total-noise SID packet and a noise- 
update SID packet by examining the flags, or information elements, included in the 
packet. 

The RBS forwards toward the IP network the SID packets received frcim the 
mobile station, indicating by the two flags whether the payload is SID data and, if it is. 
whether the SID data is an update or total noise infomiation. Because the total-noise 
SID is so important and because speech data is sent as user datagram protocol (UDP) 
packets, the risk of losing the SID can be decreased by sending that packet more than 
once, either several times all together or for example when it nonnally occurs and when 
sending the next noise-update SID packet The UDP is an IP-standard protocol that 
enables an application program on a first processor to send datagrams (packets) to an 
application program on a second processor using the IP to deliver the packets. 

The RBS or other receiving node 1 detects whether a payload is holding speech 
or SID data, and if the packet is a SID packet, the packet is saved in an area of the 
receiving node's memory that is different from the jitter buffer 10 as noted above. Also 
as noted above, usually the first SID packet in a period of silence is very important 
since it holds the total infomiation of the background noise. Without this information, a 
vocoder in the receiving node would not be able Id reconstruct the noise. 

In current GSM systems, a transmitter like a mobile station needs a period of at 
least about twenty-four speech frames or packets to be able to prepare a SID packet 
holding the total noise information. Thus, if the transmitter, during a silence period, 
detects a short speech burst (e.g., a burst shorter than twenty-four speech packets), 
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the transmitter will send the last noise-update SID and not the total-noise SID after the 
short speech burst This is sometimes called a "hangover" case in the DTX standards, 
and is a case in which the receiving node should not move into a DTX mode, i.e., 
should not direct arriving (SID) packets to memory locations other than the jitter buffer. 
5 (See FIG. 8B.) Accordingly, there are two cases v/hen the receiving node 1 should 
move into a DTX mode, i.e., should direct arriving (SID) packets to memory locations 
other than the jitter buffer. (See FIG. 8A.) 

In the first case, the receiving node should move into DTX mode when the first 
aniving SID packet after a speech period holds the total noise information. In the 

1 0 second case, the receiving node should nrave into DTX mode when a noise-update 
SID packet arrives after a speech period and its sequence number is the next following 
or subsequent to the sequence number of a (eariier) speech frame. These two cases 
are illustrated by FIG. 8A, which depicts the jitter buffer 10 and three types of packets: 
DTX (SID) packets D, speech packets 8, and too-late, lost or not received packets X. 

1 5 From FIG. 8A, it can be seen that the packet D that arrived after the sequence of seven 
speech packets should be a noise-update SID packet because speech packets do not 
occur between a total-noise SID and a noise-update SID except in the "hangover case. 
Having moved into DTX mode, the SID packet arrived after the last speech packet is 
released to the application In due course. In a GSM system, the application will only 

20 send one particular SID packet to a receiving node like a mobile telephone once. The 
receiving node, when in DTX mode, moves out of DTX mode when a packet hoWing 
speech informatfon arrives. 

In cases other than the two depicted by FIG. 8A, packets should not be released 
to the application, i.e., when the node is receiving a noise-update SID and the last 

25 packet was lost. This results In a situation similar to the situation when speech packets 
are lost in an IP networi^ and is illustrated by FIG. 8B, from which it can be seen that 
the packet X that arrived l^sfore the packet D might have been a total-noise SID, 
making the packet D a noise-update SID. If such a case, i.e., when a total-noise SID 
has been lost, one should not have the receiving node move into the DTX mode 

30 because the received infonnation will be recovered pooriy. 

Regardless of whether the receiving node moves into DTX mode or not, the jitter 
buffer should not count lost packets during tiie time SID packets are received. In other 
words, it is currently believed that the methods illustrated by FIGS. 4, 5 should not be 
implemented while SID packets are aniving, except to the extent that the size of the 
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Jitter buffer advantageously can be changed based on previously arrived non-SID 
packets during DTX periods. 

Assuming pacicets are arriving at the RBS for transmission to a mobile station, 
the first SID packet at the start of a saence period is usually sent to the mobile station in 
5 a traffic channel established between the RBS and mobile station. All other SID frames 
during that silence period are usually sent in a control channel, in particular the slow 
associated control channel (SACCH). If the first SID frame Is a total-noise SID frame, it 
should only be sent once to the mobile station, but if the SiD frame that was last sent 
was a totat-nolse SID frame and if a noise-update SID frame has not yet an-ived and it 

1 0 is time to send a new SID frame to the mobile station in the SACCH , then there might 
be a problem. Accordingly, at this time, a noise-update SID frame holding information 
indicating no change in the noise should be sent in accordance with one aspect of 
Applicants' invention. Such a SID may be called a delta zero SID packet or frame, and 
the frame including a delta zero SID could be hard coded or generated on the fly (in 

1 5 real time) at the time when it is needed. 

It will be appreciated by those of ordinary skill in the art that this invention can be 
embodied in other specifte forms without departing from its essential character. The 
embodiments described above should therefore be considered in all respects to be 
illustrative and not restrictive. The scope of Applicants' invention is detemiined by the 

20 following claims, and all modifications that fall withHi that scope are intended to be 
Included therein. 
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WHAT IS CLAIMED IS: 

1. A receiving node In a pacl<et communication system, comprising: 

a jitter buffer tliat has a variable size, that stores pacl^ets arriving at the receiving 
node, and that releases stored packets to an application executing in the receiving 
5 node, wherein each packet has a respective sequence number, stored packets are 
released periodically, and each entry in the jitter buffer has one of a plurality of states; 
and 

a processor that varies the size of the jitter buffer based on an estimated 
variation of packet transmission delay derived from the times of arrival of stored 
10 packets. 

2. The receiwng node of claim 1 , further comprising a variance buffer that 
stores variances of the times of arrival of stored packets, and wherein a time that a first 
an-ived packet is released is based on a time of arrival of the first packet and an initial 
delay, and the estimated variation of packet transmission delay is derived from the 

1 5 stored variances. 

3. The receiving node of claim 1 , vi/herein the states of the jitter buffer 
entries are free, busy, and used, the free state indicates that no arrived packet is stored 
at that location in the jitter buffer, the busy state indicates that an arrived packet is 
stored at that location in the jitter buffer, and the used state Indicates that an anived 

20 packet stored at that location is being released to the application. 

4. The receiving node of claim 3, wherein an-ived packets are stored in 
respective locations that are marked In the busy state; packets are released in 
response to queries by the application; and when the application queries the jitter buffer 
for a next packet, that packet's respective location is changed to the used state and the 

25 respective location of the previously anived packet is changed to the free state. 

5. The receiving node of claim 1 , wherein the processor decreases the size 
of the jitter buffer while the receiving node is in a discontinuous transmission mode, 
thereby avoiding discarding anrlved packets that hold speech infomiation. 

6. The receiving node of claim 5. further comprising a DTX buffer that stores 
30 selected packets aniving at tiie receiving node; wherein an aniving packet is selected 

based on at least one of whether the aniving packet is first to anrive after a speech 
period and holds total noise information and whether the arriving packet contains noise- 
update infonnation, arrives after a speech period, and has a respective sequence 
number that is subsequent to tfie sequence number of an earlier aniving packet holding 
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speech information; and the processor decreases the size of the jitter buffer while 
packets are being selected, thereby avoiding discarding pacl<ets holding speech 
information. 

7. A method of storing in a buffer paclcets aniving at a receiving node in a 
pacl<et communication system and releasing arrived paclcets to an application 
executing in the receiving node, comprising the steps of: 

determining a time Tr to release a first anived packet to the appfication, wherein 
the time Tr is the first packet's amval time Ta plus an initial delay; 

while waiting for the first arrived packet to be released from the buffer, 
comparing a current time to the time Tr and releasing the first arrived packet when the 
time Tr has passed; and 

after the first arrived packet is released, releasing stored packets periodically at 
first intervals. 

8. The method of claim 7, wherein the comparing is peri^ormed in response 
to queries from the application that occur periodically at second intervals, stored 
packets arrived after the first anived packet are released in response to queries from 
the application that occur periodically at the first intenrals. and the first inten/al is at 
least as long as the second interval. 

9. The method of daim 7, wherein the first interval is substantially equal to 
transmission intervals between aniving packets. 

10. A method of adapting a size of a buffer that stores packets aniving at a 
receiving node In a packet communication system, comprising the steps of: 

counting a number of arrived packets having sequence numbers lower than that 
of an oldest arrived packet stored in the buffer; 

comparing the number to an accepted loss parameter; 

if the number is greater than the accepted loss parameter, increasing a change 
indicator counter and if the number is equal to or less than the accepted loss 
parameter, decreasing ttie change indicator counter; 

increasing the size of the buffer when the change indicator counter reaches an 
Indicator roof parameter if the buffer is not aiready at its largest pennitted size; and 

decreasing the size of the buffer when the change indicator counter reaches an 
indicator fkjor parameter if the buffer is not aiready at its smallest pemiitted size. 

1 1 . The method of daim 1 0, wherein the number is compared when the buffer 
has stored a number of packets specified by a sampling interval parameter. 
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12. The method of claim 10, further comprising the step of determining the 
size of the buffer by perforrning the steps of: 

determining an expected arrival time of a packet in relation to an anival time of a 
first packet of a packet sequence; 
5 determining an arrival time variance for the packet; 

determining a measured delay that js a time the packet will be delayed in the 

buffer; 

determining a desired delay based on the arrival time variance and the accepted 
loss parameter; and 

1 0 determining the size of the buffer based on tiie desired delay and the measured 

delay. 

13. The method of claim 12, wherein anival time variances are stored in a 
variance buffer, stored anival time variances are sorted and normalized, measured 
delays are accumulated for packets having anival time variances stored in the variance 

15 buffer, the desired delay is determined based on the sorted, normalized amval time 
variances and the accepted loss parameter, and the size of the buffer is detennined 
based on the desired delay and an average measured delay derived from the 
accurnulated measured d^ays. 

14. The method of claim 1 0, wherein the size of the buffer is decreased while 
20 the ^ece^/i^g node is in a discontinuous ftansmission mode, thereby avoiding 

discarding arrived packets that hold speech information. 

15. The m^hod of claim 14, farther comprising the step of storing in a DTX 
buffer selected packets arriving at the receiving node: wherein an aniving packet is 
selected based on at least one of whether the aniving packet is first to anive after a 

25 speech period and holds total noise infonnation and whether the arriving packet 

contains noise-update infonnation, arrives after a speech period, and has a respective 
sequence number that is subsequent to the sequence number of an eariier aniving 
packet holding speech infonnation; and the size of the buffer is decreased while 
packets are being selected, thereby avoiding discarding packets holding speech 

30 Information. 
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